Gracefully Degrading Systems Using the Bulk-Synchronous Parallel Model with Randomised Shared Memory
نویسندگان
چکیده
The Bulk-Synchronous Parallel Model, BSPM, was proposed as a bridging model for parallel computation by Valiant. By using Randomised Shared Memory, RSM, this model o ers an asymptotically optimal emulation of the PRAM[1]. By using the BSPM with RSM, we show how a gracefully degrading massively parallel system can be obtained through: memory duplication to ensure global memory integrity, and to speed up the recon guration; a global recon guration method that restores the logical properties of the system, after a fault occurs. We assume fail-stop processors, single faults, no spare processors, and no signi cant loss of network throughput as a result of faults. Work done during recon guration is shared equally among the live processors, with minimal coordination. The overhead of the scheme and the graceful degradation achieved depend on the program being executed. We evaluate the recon guration, overhead, and graceful degradation of the system experimentally.
منابع مشابه
On the performance of parallel algorithms on a gracefully degrading system achieved through the BSPM
The Bulk Synchronous Parallel Model (BSPM) proposed by Valiant [10] can be used to construct a gracefully degrading system, as described in [7]. We show that by using the BSPM as a cost model we can estimate an upper bound on the overhead introduced to achieve a gracefully degrading system. The e ect of processor failures on the performance of such a system is considered in terms of the paramet...
متن کاملThe design and analysis of bulk-synchronous parallel algorithms
The model of bulk-synchronous parallel (BSP) computation is an emerging paradigm of general-purpose parallel computing. This thesis presents a systematic approach to the design and analysis of BSP algorithms. We introduce an extension of the BSP model, called BSPRAM, which reconciles shared-memory style programming with e cient exploitation of data locality. The BSPRAM model can be optimally si...
متن کاملNestStepModelica - Mathematical Modeling and Bulk-Synchronous Parallel Simulation
The majority of parallel computing applications are used for simulation of complex engineering applications and/or for visualization. To handle their complexity, there is a need for raising the level of abstraction is specifying such applications using high level mathematical modeling techniques, such as the Modelica language and technology. However, with the increased complexity of modeled sys...
متن کاملManaging irregular remote accesses to distributed shared arrays in a bulk-synchronous parallel programming environment
NestStep is a parallel programming language for the BSP (bulk-synchronous parallel) programming model. In this paper we describe the concept of distributed shared arrays in NestStep and its implementation on top of MPI. In particular, we describe a novel method for runtime scheduling of irregular, direct remote accesses to sections of distributed shared arrays. Our method, which is fully parall...
متن کاملNestStep: Nested Parallelism and Virtual Shared Memory for the BSP model
NestStep is a parallel programming language for the BSP (bulk–synchronous–parallel) model of parallel computation. Extending the classical BSP model, NestStep supports dynamically nested parallelism by nesting of supersteps and a hierarchical processor group concept. Furthermore, NestStep adds a virtual shared memory realization in software, where memory consistency is relaxed to superstep boun...
متن کامل